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This appendix describes the Integrated Communications Experiment (ICE), a 
comprehensive computer software capability developed for the ComField Project. 
Each major characteristic of the data processing system is treated separately: 
natural language processing, flexibility, noninterference with the educational process, 
multipurposeness, comprehensive retrieval, centralization of records and distribution, 
statistical analysis, long-range data management, editing capacity, automatic coding 
and uncoding, data accuracy, uniform data preparation, and computer independence. 
Included also are descriptions of the three major programing systems — file 
generation and translation, file management and editing, and file retrieval — which are 
considered a minimum set of software functions which would satisfy the total demand 
of a record system in the natural language. Two flow charts are appended: (1) 
schematic configuration showing one possible ComField system using a comprehensive 
s torage/retieval system such as IlE and (2) the ICE system including document 
description, translation, information retrieval, and file maintenance and editing. This 
document and SP 002 I55-SP 002 ISO comprise the appendixes for the ComField 
Model Teacher Education Program Specifications in SP 002 154. US) 
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THE INTEGRATED COMMUNICATIONS EXPERIMENT 
CICE) SUMMARY 

Robert Coffin 



In the majority of data processing facilities, each new 
application necessitates the investment of considerable time and 
expense before the originator of a project is able to benefit 
from the collective effort of professional computer personnel. 

Too often, this expense and delay has resulted in disillusion 
and disappointment on the part of educators who have little 
appreciation for the technical Implications of demands which 
they consider perfectly reasonable from their own points of view. 
Traditional problems such as grade and attendance reporting, 
payroll accounting and other administrative functions have won 
general acceptance largely because the benefits are tangible and 
easily justified in terms of costs. However, these programs 
were hopelessly inadequate for processing information pertaining 
to the more intricate and critical problems of constructive eval- 
uation of student performance, maintaining libraries of 
curriculum alternatives for individualized instruction, and 
analysis of subjective student data as collected by teac h ers, 
counselors, and special service workers. These problems, diverse 
in nature and continuous in tine, had to be bound together in a 
common approach in such a manner that problems and situations, 
even though unanticipated, could be accofjmodated without large, 
additional expenditures of time and money. The landmark ^ 

decision to develop a comprehensive computer software capability^ 
was made in the Department of Educational Research in 1965; and 
under financial support of the Hill Family Foundation of * 

St. Paul, the Integrated Communications Experiment (ICE) was - 
designed and started in the spring of 1966. The conceptual model 
for the ICE system existed as successively operating medical 
record system in the Portland area*, consequently the designers 
of the ICE system benefited from a huge investment in a general- 
ized information storage and retrieval system developed in a 
related environment. 

An underlying promise of the ICE system was that the filing 
system, from data collection to process i-g final analysis 



*The fundamental concept herein presented is based upon wor* 
conducted at the Oregon Regional Primate Research Center upon 
Contract to the National Institution of Health, Institute of 
Child Health and Development. (Contract PH43-64-1163) . 
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had to be implemented so that the user, teacher, supervisor, or 
researcher could continue to record information in his ohi lang- 
uage. This decision was forced by the simple expedient that to 
gain the acceptance and cooperation of the user, it is not pos- 
sible to require him, except in rare instances, to code Informa- 
tion or adopt any recording method which is fundamentally alien 
to their traditional habits. In general, the lang u age of the 
user dictates the form of the information input whether entirely 
verbal, entirely quantitative, or a mixture of quantitative and 
verbal data. To this objective, the ICE system allows the user 
to freely define his document to suit his own subject area, Am 
easy- to -learn notation is used to describe a docu m ent for m a chine 
processing. An example of a typical input document is the 
Remedial Reading Record shown on Page ”5”. The fixed information, 
called the Header, serves to identify the record in the master 
file. The remainder of the record, the variant, is arrayed in 
outline form to reflect the breadth of subject matter of Inte n t 
to a particular user in a specialized application. From the view 
of programming economy, it is important to note that any document 
arrayed in a similar manner may be accommodated in the ICE system 
without further programming effort. At the present time, several 
dozen information documents have been designed and are in use in 
the Portland Public School System. 

A comprehensive retrieval system is currently being imple- 
mented to provide the user with a variety of types of ou tput . 

User requests tend to fall into three broad categories. Hie most 
frequent requests are for sets of raw records which have a speci- 
fic set of characteristics and serve to answer questions of 
transient importance — raw records may be red u c e d to take the 
form of reports which cover particular cross-sections of the 
master file. A further refinement is statistical analysis tdrich 
may be obtained by linkage of the ICE master file to existing 
statistical programs. 

Important auxiliary functions offered by the ICE system aid 
in overcoming many of the disadvantages of variable content, 
variable length record keeping. Capacity for editing of mis- 
spellings has provided a means for improving the quality of text, 
removal of ambiguities and standardization of descriptive entries. 
Other editing and file maintenance routines give the computer 
personnel control over the data files in order to insure that 
data is recorded and reproduced with lailuu accuracy. 

The following characteristics of the ICE system are pr e s e n te d 
as the minimum conditions to satisfy the total system requirements 
for ComField. 
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Flexibility 



Any records system which will withstand the tremendous 
pressure of time must have a capacity to change through time. 
Inherent in this statement is the fact that a record system can 
be neither rigid in subject matter that it accommodates nor lim- 
ited in the diversity of that subject matter. The user must have 
the ability to vary the content of records as the need dictates. 
Further, there may be several kinds of information contained in 
the same file. In rigidly defined data collection systems, it is 
possible to define an entry on a particular card which is assumed 
to have certain limits and it is not likely to change. However, 
in the educational environment there are very few appropriate 
subjects in which rigidly defined data collection systems are 
meaningful. Experience has shown that information changes dyna- 
mically, thereby making any rigid planning obsolete in a very 
short period of time. 

Economic considerations of the rigid versus the flexible 
system are very clear. Natural Language Processing permits more / 
dynamic expression and communication of information within prac- 
tical economic limits. In the educational environment it is not 
possible nor desirable to have a large staff of highly salaried, 
skilled programmers in order to facilitate continual change. If 
a gen eral record system could be implemented which would allow a 
wide variation in content and complete capacity to growth through 
time, and if such a system could be implemented and turned over 
to a small data processing group for operation, then the scope of 
an automated records system becomes quite feasible. 



Noninterference with the >> Educational >> Process 

It is not realistic to assume that data processing personnel 
can impose a drastic change upon educational personnel without 
having a great deal of friction develop which would lead to a 
state of noncooperation and decline of the records effort. Per- 
haps the most important feature offered by Natural Language Pro- 
cessing is that of reducing the problem of interfering with the 
"educational process." This is to say, it is possible merely to 
substitute one piece of paper for another in order to implement 
the data collection scheme. The teacher, special personnel, or 
administrator need not undergo a lengthy process of orientation, 
familiarize himself with a particular coding scheme, or write in 
any manner which is foreign to him. Data processing may then 
become a harmonious part of the long established educational 
system. Natural Lang uage Processing can accomplish this end. 
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Mul t ipnrpo sene sa 



It has been considered impractical to use information only 
once. If information is in the natural language of its user, a 
multiplicity of use can be easily achieved. Further, it has been 
considered impractical to maintain separate record systems for 
separate purposes within small institutions. A record system 
must serve groups of diverse interests in a unified manner w c 
does not interfere with the normal processes of each group. 
Multipurpose use of records is mandatory in the educational env 
ronment. Natural Language Processing offers a unified approach 
to record keeping and at the same time can serve many different 

people. 



Natural Language Processing does not exclude the use of 
coding schemes since any coded system is a subset of the natura 
language. Coding schemes are encouraged whenever the user feels 
they will provide more effective use of information. Natural 
Language Processing provides the user in the educational environ- 
ment flexible, multipurpose storage and communication of informa- 
tion with the least interference with the "educational process. 
There are twelve principal features that should be considered in 
the programming structure of a natural language system. These 
features are viewed as desirable for any filing system. 



Flexibility 



The basic programming structure must contain the ability to 
translate and process variable length, variable content records 
in order to provide the system with flexibility. The integr y 
(content, length) of a record must be maintained throughout all 
processing. The programming system must be able to accept new 
subject areas that were not anticipated in the initial sys e 
design. Change cannot be predicted, so the ability to change must 

be an integral part of the system. 



rnmpatibilitv with the " Educational" Process 

In order to gain general acceptance, the programming system 
should preserve the Integrity of existing source documents and 
reports. The most important single feature that can be offered 
in terms of gaining acceptance of the system is simp y 
that it is possible to substitute one piece of paper, which is 
much like the one it replaces, for another. Further, t ere mus 
be the ability to design a source document which can accommodate 
the material which is to be recorded without introducing any bias 



235 



into the information. The amount or detail of information should 
be left up to the user. 



Comprehensive Retrieval System 

In order for a records system to serve many different groups 
of personnel, it is necessary to have a broad and comprehensive 
retrieval system which anticipates general requirements of each 
group. Just as it is not possible to anticipate the breadth nor 
scope of the information content, neither is it possible to anti- 
cipate the breadth and scope of retrieval functions which will be 
required of the system. Therefore, a very generalized and flex- 
ible retrieving system is required to handle the many different 
demands for information. 



Centralization of Records 



The system must provide for centralization of record keeping 
in a smooth flowing data collection and processing scheme which 
allows for management of bottlenecks, data flow, and currency of 
subject information. Centralization of records is desirable, pro- 
vided that it can be achieved within practical limits. The system 
eliminates many limiting factors characteristic of hand systems, 
but does not replace complete systems planning. Most often records 
are decentralized because it is too cumbersome to generate and 
maintain a unified, centralized system. 



Centralized Distribution 



A common complaint of centralized record systems is the 
inability for adequate distribution of records. In centralized 
hand record systems, records are continually being handled causing 
wear and jeopardizing the quality of the file. The availability 
of records is never certain and one must put forth special effort 
to obtain them. In a computerized system, the source documents s 
are never removed for purposes of distribution; consequently, the 
user is assured of continued high speed availability. 



Statistical Analysis 

From the view of the researcher, an individual record may be 
of particular interest, but more broadly he is interested in popu- 
lations of records which share certain features. It is therefore 
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necessary to supply to the user the ability to extract in meaning- 
ful arrays the basic documents in a form suitable for statistical 
analysis . 



Long-Range Data Management 

The initial system planning must include provision for 
records which have accumulated through time and which are of 
potential value, but which must be considered inactive. This, 
plus other data management considerations (sort, merge) must be 
provided for by the basic programming system. 



Editing Capacity 

To a large degree the success of a filing system will depend 
on the ease and naturalness with which errors can be corrected. 

An editing system should be incorporated to first identify and 
categorize all types of errors, and secondly to provide a conven- 
: .int and natural mechanism to edit these errors. It is necessary 
to maintain a constant vigilance on the quality of material that 
comes into the files. Provisions should be made to identify not 
only misspellings, but also ambiguities and word equivalences. 



Automatic Coding and Uncoding 

The capacity to perform generalized editing has a secondary 
benefit which enables an investigator to arrive at a satisfactory 
coding structure for his own information after he has had some 
experience in collecting and recording his data. It is not 
necessary for the user, who may be naive in terms of machine 
analysis and data collection, to decide upon a coding structure. 
Rather, it is possible for him to collect data in his own natural 
stride and at a later time identify a possible structure which 
will enable him to rapidly and concisely analyze his information. 



Data Accuracy 

Basic to data accuracy is a fundamental concept that one and 
o nl y one recording of the source information can take place. In 
many systems it has been necessary to xerecord and recopy by hand 
various portions of recorded information before it is prepared for 
the machine. This results in an unacceptably high error rate and 
corresponding low data quality. The programming system must pre- 
serve absolute accuracy of data. 



237 




Uniform Data Preparation 



In order to reduce error in data preparation , the data 
collection sheets should be structured in such a way as to main- 
tain consistent appearance to the personnel preparing the data, 
and once learned, a new subject area with its corresponding data 
sheet need not be a separate object of instruction. Without this 
uniformity the preparation is difficult, tedious, and increases 
the probability of error. 



Computer Independence 



Natural Language Processing as a philosophy of record keeping 
is absolutely machine independent; that is, the system planners 
may consider the scope and the nature of the problem relative to 
any suitable computer configuration. Of course, at the time it 
becomes an operating system, the philosophy and system features 
must be reduced to a set of machine instructions which then 
become machine dependent. If the implementation is consistent 
with the philosophy of machine independence, the user can expect 
to benefit from vertical and horizontal machine communication. 

The following list of major programming systems and their ^ 
brief description are viewed as a minimum set of software func- 
tions which would satisfy the total demand of a records system in 
the natural language. 



File Generation and Translation System 

This eat of programs allows for the initial translation of 
records into the master file. During the translation process each 
record is subjected to a series of tests which detect gross errors 
by comparing the incoming record with an on-line dictionary which 
contains an image of each type of source document. Erroneous 
records are not merged with the master file, but are printed to- 
gether with a diagnostic statement for subsequent correction and 
reprocessing. The system also must provide capacity for the defi- 
nition of new source document images, and modification of existing 
images. 

Each record must have an identifying or header portion which 
determines its unique position in the master file. One scheme 
might be to structure each individuals file as a chronology. 

While the header portion is fixed, the other portion is variable. 
The translator must perform interpretive operations on the 
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variable portion of the record. The interpretation of a record is 
controlled by the dictionary image which giyes the translator its 
flexibility. The translator can be designed to accept any type 
of input media (cards, paper tape) and it is the purpose of the 
translator to check any error considerations characteristic of 
each media. 



File Management and Editing System 

Once a file has been created there is a problem of mainte- 
nance. The life and value of a file may depend on a flexible 
mana gement system which must have the ability to delete records 
from a file, to dynamically combine files and to condense in- 
active records for economical storage. It is further desirable 
to perform editing functions within a file. The editing system 
provides the ability to edit errors and examine text. As men- 
tioned above, spelling errors and inconsistent language detracts 
from the value of information. This programming system will 
improve quality and longevity of files. 



File Retrieval System 



This complex set of programs must provide the user with 
adequate communication to the master file to effectively identify 
and generate information. The retrieval system functions like a 
compiler in that it interprets a retrieval statement and compiles 
a program which screens each record as it is received into memory. 
A retrieval statement must be checked for validity against the 
dictionary tape. Basically there are three classes of retrievals. 
Retrieval of original records will occur most frequently in which 
the user must have the ability to retrieve all, or part, of an 
individual’s file. Secondly, there is a need to create subfiles 
which will be used in connection with report generation programs. 
The report generated by these programs can be very helpful in 
analyzing the information contained in the master file. It is 
also possible to freeze the status of the master file by creating 
subfiles. Finally, it is desirable to link the master file with 
other programming systems which deal with fixed length records 
such as FORTRAN or COBOL, and thereby gives the user the ability 
to link the master file with any existing programs. 

Other programming systems can be added to make effective use 
of the master file. If the master file is contained on a random 
access storage device, it is desirable to have an automatic in- 
dexing system. Further, it is highly desirable to have a textual 
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analysis systen which analyzes content and syntax of word 
groupings. 

In suflflary, Natural Language Processing has seyeral advan- 
tages which sake it desirable for use in the educational environ- 
flent. Only through extensive systems considerations and planning 
will the educator realize any benefit. The programming systems 
provide an effective tool. It is the responsibility of the 
educator to develop the content and skill necessary to use the 
tool. 
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SCHEMATIC CONFIGURATION SHOWING ONE POSSIBLE COMFIELD SYSTEM USING 
A COMPREHENSIVE STORAGE/RETRIEVAL SYSTEM SUCH AS ICE 
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